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Abstract. We prove that the Random-Edge simplex algorithm requires an 
expected number of at most 13n/^/d pivot steps on any simple d-polytope with 
n vertices. This is the first nontrivial upper bound for general polytopes. We 
also describe a refined analysis that potentially yields much better bounds for 
specific classes of polytopes. As one application, we show that for combinatorial 
d-cubes, the trivial upper bound of 2 d on the performance of Random-Edge 
can asymptotically be improved by any desired polynomial factor in d. 



1. Introduction 

Dantzig's simplex method 18. is a widely used tool for solving linear programs 
(LP). The feasible region of an LP is a polyhedron; any algorithm implementing the 
simplex method traverses a sequence of vertices, such that (i) consecutive vertices 
are equal (the degenerate case) or connected by a polyhedron edge, and (ii) the 
objective function strictly improves along any traversed edge. In both theory and 
practice, we may assume that some initial vertex is available, and that the optimal 
solution to the LP is attained at a vertex, if there is an optimum at all. It follows that 
if the algorithm does not cycle, it will eventually find an optimal solution, or discover 
that the problem is unbounded (see e.g. Chvatal's book [7] for a comprehensive 
introduction to the simplex method). 

For most (complexity-)theoretic investigations, one can safely assume that the 
LP's that are considered are bounded as well as both primally and dually non- 
degenerate [TSl . Thus, we will only deal with simple polytopes, i.e., bounded d-Ai- 
mensional polyhedra, where at each vertex exactly d facets meet, and with objective 
functions that are non-constant along any edge of the polytope. 

The distinguishing feature of each simplex-algorithm is the pivot rule according 
to which the next vertex in the sequence is selected in case there is a choice. Many 
popular pivot rules are efficient in practice, meaning that they induce a short vertex 
sequence in typical applications. The situation in theory is in sharp contrast to this: 
Among most of the deterministic pivot rules proposed in the literature (including 
the ones widely used in practice), the simplex algorithm is forced to traverse ex- 
ponentially (in the number of variables and constraints of the LP) many vertices 
in the worst case. It is open whether there is a pivot rule that always induces a 
sequence of polynomial length. 

To explain simplex's excellent behavior in practice, the tools of average case 
analysis [5] and smoothed analysis |20j have been devised, and to conquer the worst 
case bounds, research has turned to randomized pivot rules. Indeed, Kalai |13[ I14j 
as well as Matousek, Sharir and Welzl [17] could prove that the expected number of 
steps taken by the Random-Facet pivot rule is only subexponential in the worst 
case. These results hold under our above assumption that the feasible region of the 
LP is a simple and full-dimensional polytope. 
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Much less is known about another (actually, the most natural) randomized pivot 
rule: choose the next vertex in the sequence uniformly at random among the neigh- 
bors of the current vertex with better objective function value. This rule is called 
Random-Edge, and unlike Random-Facet, it has no recursive structure to peg an 
analysis to. Nontrivial upper bounds on its expected number of pivot steps on gen- 
eral polytopes do not exist. Results are known for 3-polytopes [6j[T2], d-polytopes 
with d + 2 facets [5] , and for linear assignment problems [H] . Only recently, Pe- 
mantle and Balogh solved the long standing problem of finding a tight bound for 
the expected performance of Random-Edge on the d-dimensional Klee-Minty cube 
[3]. This polytope is the 'mother' of many worst-case inputs for deterministic pivot 
rules PS 

None of the existing results exclude the possibility of both Random-Facet and 
Random-Edge being the desired (expected) polynomial-time pivot rules. In the 
more general and well-studied setting of abstract objective functions on polytopes 
P (221 Ell Q3] , superpolynomial lower bounds are known for both rules, where the 
construction for Random-Edge [18] is very recent and much more involved than the 
one for Random-Facet [16]. Both approaches inherently use objective functions 
(on cubes) that are not linearly induced. 

In this paper, we derive the first nontrivial upper bound for the expected perfor- 
mance of Random-Edge on simple polytopes, with edge orientations induced by 
abstract objective functions. Even when we restrict to linear objective functions on 
combinatorial cubes, the result is new. The general bound itself is rather weak and 
also achieved for example by the deterministic Greatest-Decrease rule. The 
emphasis here is on the fact that we are able to make progress at all, given that 
Random-Edge has turned out to be very difficult to attack in the past. Also, our 
new bound separates Random-Edge from many deterministic rules (for example, 
Dantzig's rule, Bland's rule, or the shadow vertex rule) that may visit all vertices 
in the worst case [2]. 

In a second part, we refine the analysis, with the goal of obtaining better bounds 
for specific classes of polytopes. Roughly speaking, these are polytopes with large 
and regular local neighborhoods. Our prime example is the class of combinatorial 
cubes, for which we improve the general upper bound by any desired polynomial 
factor in the dimension. As before, this also works for abstract objective functions 
and thus complements the recent lower bound of Matousek and Szabo [TH] with a 
first nontrivial upper bound. 

2. A Bound for General Polytopes 

Throughout this section, P is a d-dimensional simple polytope with a set V of n 
vertices. A directed graph D = {V, A) is called an acyclic unique sink orientation 
(AUSO) of P if 

(i) its underlying undirected graph is the vertex-edge graph of P, 

(ii) D is acyclic, and 

(iii) any subgraph of D induced by the vertices of a nonempty face of P has a 
unique sink. 

Any linear function ip : V — > IR that is generic (non-constant on edges of P) induces 
an AUSO in a natural way: there is a directed edge v — > w between adjacent vertices 
if and only if (f(v) > (f{w). The global sink of the AUSO is the unique vertex that 
minimizes ip over P. If ip is any generic (not necessarily linear) function inducing 
an AUSO that way, ip is called an abstract objective function. For a given AUSO 
D of P, any function (p that maps vertices to their ranks w.r.t. a fixed topological 
sorting of D is an abstract objective function that induces D. In general, D need 
not be induced by a linear function, for example if D fails to satisfy the necessary 
Holt-Klee condition for linear realizability |llj . For the remainder of this section, 
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we fix an AUSO D of P, an abstract objective function <p that induces D and some 
vertex s G V. 

Let tt be the random variable defined as the directed path in D, starting at s 
and ending at the sink i> opt of D, induced by the Random-Edge pivot rule. From 
each visited vertex v ^ w pt, ^ proceeds to a neighbor w of v, along an outgoing 
edge chosen uniformly at random from all outgoing edges. 

For each v £ V, denote by 

out(u) := {w e V : (v,w) G A} 

the set of all smaller (w.r.t. <p) neighbors of v. If | out(u)| = fc, then v is called a 
k-vertex. We denote by 14 the set of all k- vertices. 

For every vertex v ^ v opt on the path tt let v' be its successor on tt. We denote 

by 

S(v) := {w € out(w) : ip(v') < <p{w)} , 
the set of neighbors of v that are 'skipped' by tt at the step from d to «'. For every 
< k < d let 

%(tt) := \{v E vrn V k : > L^^Jll 

be the number of fc-vertices on tt, where tt skips at least |_§ J neighbors. (Here, as 
in the following, we write, depending on the context, V for the set of vertices on 
the path tt.) 

If we denote by nk(rr) the total number of k- vertices on the path tt, then we 
obtain 

(1) EMtt)] > §E[n fe (7r)] . 
Indeed, we have 

EfabM] = E P[«€7rand|5(«)|>L|J] 

and 

EMtt)] - X] P[l ' G ^ • 
The claim then follows from 

P[\S{v)\>[^pi\\vGTT] > |. 

Due to > </?(w) > <p(v') for all w € S{v), the sets 5(u) are pairwise disjoint. 
Thus, we obtain (exploiting the linearity of expectation) for the number length(7r) 
of vertices on tt 

d d 

E[lcngth(7r)] < n-^EfokMlLfj < n - ^ sL|J E [ n *W] 

fc=0 fe=0 

(where we used (|T]) for the second inequality). Clearly, we have E [length (71")] = 
T,k=o E [nk(ir)l Therefore, we obtain (note |L|J > ^r 1 ) 

d d 

(2) E[length(vr)] < min { ^ E[n*(7r)], n - ^ ^E[n fc (7r)]} . 

fc=0 fc=0 

If /ife denotes the total number of fc-vertices in V, then we clearly have < 
EK(tt)] < h k . Thus, © yields 

d <j 

(3) E[length(7r)] < max {min{^~] xu, n — : < xu < /ijt for all fc} . 

fc=0 k=0 

In (|3|), the maximum must be attained by some x S R d+1 for which the minimum 
is attained by both ^ Xfc andn — X) ^T^ x fe- Indeed, if^Xfc < n— ^x^fffc then not 
all Xfc can be at their respective upper bounds hk (since n = ^h^), thus one of them 
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can slightly be increased in order to increase the minimum. If^Xk > ^^%k 
then not all x k can be zero (since this would yield > n), so one of them can be 
decreased in order to increase the minimum. Thus we conclude 

d d 

(4) E[length(vr)] < max {J^ x k : ^2 = n, < x k < h k for all k) . 

fc=0 k=0 

By (weak) linear programming duality (and exploiting n = Y] k _ h k once more), 
we can derive from Q the estimate 

d 

(5) E[length(7r)] < £ h k ■ max{y, 1 - ^y} 

k=0 

for every i/eR. 

In the sequel, we need two important results from the theory of convex polytopes. 
The parameters h k are independent of the actual acyclic unique sink orientation of 
the polytope. The h-vector formed by them is a linear transformation of the /- 
vector of the polytope, storing for each i the number of i-dimensional faces of the 
polytope. 

The first classical result we need are the Dehn-Sommerville equations 

(6) h k = h d _ k for all < k < d 

(see [24] Sect. 8.3]). The second one is the unimodality of the h-vector: 

(7) h < hi < ... < hy d/2 ^ 

The latter is equivalent to the nonnegativity of the g-vector, which is one of the 
hard parts of the g-theorem for simplicial polytopes, see |24| Sect. 8.6]. 
From (|6|) and ((7]) we can derive 

d _ 

n = J2 h k > { d -%^d) h yiVd\ > 

fc=0 

which yields (for d > 64) 

(8) < 



d-8Vd ' 

Now we choose y :— 1/yd in ([5|). We have 

1 k — 1 r- 

—r=>l =- <3> k > 4Vd - 3 . 

Vd \\[d 

Thus, © (with y = 1/Vd) gives 

(9, Ellens e £ 

By the unimodality of the h- vector and ([8]), the first sum in ([9]) can be estimated 
by ' 

n , 4n 12rt 

Clearly, the second sum in ([9]) is bounded by nj\fd. The resulting total bound 
of \2>nj\fd also holds for d < 144, because n is a trivial upper bound. Thus we have 
proved the following result. 
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Theorem 1. The expected number of vertices visited by the Random-Edge simplex- 
algorithm on a d- dimensional simple polytope with n > d+1 vertices, equipped with 
an abstract (in particular: a linear) objective function is bounded by 

Vd 

A similar analysis reveals that the running-time for the Greatest-Decrease- 
rule is bounded by 

C ■ — 

In each step, this rule selects the neighboring vertex with smallest tp-vaiue, thus 
skipping all other neighbors of the current vertex v. 

For general simple polytopes, our analysis of the bound for Random-Edge 
stated in ([3]) is essentially best possible. This can be seen through the examples of 
duals of stacked simplicial polytopes (see, e.g., [J)), which are simple d-polytopes 
with n vertices, ho = hd = 1, and h^ = f° r all 1 < A; < d — 1. 

3. A Bound for Cubes 

The core argument of the analysis presented in Section [5] is the following: For 
every vertex on the Random-Edge path tt with out-degree k we know that tt 
skips (in expectation) fc/2 vertices in the single step from v to its successor. Wc 
then exploited the Dchn-Sommervile equations as well as the unimodality of the 
/i-vector in order to argue that many vertices on tt must have large out-degree - 
unless tt is 'short' anyway. 

For the d-dimensional cube, we have much more information on the ft-vector: 
hk = (^) for every k. Thus, 'most' vertices have out-degree roughly d/2 in case of 
cubes. We will exploit this stronger knowledge in a sharper analysis for cubes, which 
relies on studying larger structures around vertices than just their out-neighbors. 
We actually do the analysis for general polytopes and obtain a bound on the ex- 
pected path length in terms of two specific quantities. Later we bound these quan- 
tities for the case of cubes. 

3.1. The General Approach. Within this subsection, (as in Section[2|), let P be 
a d-dimensional simple polytope with n vertices V, D = (V, A) an AUSO of P, 
ip : V —> R an abstract objective function inducing D, and s € V a fixed vertex. 
We denote by dist^(i>, w) the length (number of arcs) of a shortest directed path 
from v to w (dist^(w, w) may be 00 if there is no such path). 

Definition 1 (i-reach). Let t, k € IN and v E V. 

(1) We call 

Rt{v) := {w e V : dist~*(v,w) < t} 

the t-reach of v. The boundary ofKt{v), denoted by <9R t (u), is the set of 
all w G Kt(v), for which there is a directed (not necessarily shortest) path 
of length precisely t from v to w. 

(2) The t-reach Rt(w) is fc-good if 

\out(w)\ > k 

holds for all w G R*(w) with dist^(v, w) <t—l. 

(3) A vertex v is (t, fc)-good if its t-reach is k-good. The set of all (t, k)-good 
vertices is denoted by G(t,k). 

In particular, if v is (t, A:)-good, the optimal vertex w opt may occur in the bound- 
ary of Rt(i>), but not in its interior. For t,k G N, we define 

g{t,k) := mm{\dR t {v)\ : v&G(t,k)}. 
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For every vertex v and some tg N, denote by (the random variable) w t (v) 
the vertex that is reached by the Random-Edge simplex-algorithm, started at v, 
after t steps (let Wt(v) :— i> op t in case the sink is reached before step t). Generalizing 
the notion from Section [2j we denote by 

S t (v) := {u£R t (v) : <p(u) > ip{w t {v))} 

the set of vertices in Rt(i>) left behind while walking from v to Rt(u). 

Lemma 1. For every t, k £ tKl and v G G(t, k), we have 

p[ |S t(w) |>£(|M] > 

Proof. Let dHt(v) — {u±, . . . , u g } with <p(ui) > . . .ip(u q ). By construction, there 
is some i* with Wt(v) = U{*. Since the outdegree at every vertex is at most d, we 
have 

P[i* = i] > i 

for every 1 < i < q. Therefore, 

P[i* > 3/2] > ^ 

holds. Since q > g(t, k) holds and because i* > g(t, k)/2 implies > g(t, k)/2, 

the claim follows. 

□ 

Now let us consider the path it followed by the Random-Edge simplex-algorithm 
started at s (ending in v op t). For t, k S N with t > 2 and k > 1, we subdivide it into 
subpaths with the property that every subpath either has length one and starts at a 
non-(i, fc)-good vertex or it has length t (a long subpath) and starts at a {t, fc)-good 
vertex. (Such a partitioning is clearly possible.) 

Let n tt k(^) be the number of long subpaths in our partitioning. We denote the 
pairs of start and end vertices of these long paths by (xi, y\), ... (x„ t h f n \, y nt h r n \)- 
Let 

S t {xi) := {« £ R((i,) : ip(u) > cp(yi)} 

and define 

VM ■= |{ie{l,... 1 n t , i k(7r)} : \S t (xi)\ > ^}| 

to be the number of those long subpaths which leave behind at least S^M. vertices 
from YL t (xi). 

Using Lemma [T] (note that St(xi), conditioned on the event that Xi is the start 
vertex of a long subpath in the partitioning of 7r, has the same distribution as 
St(xi)), we can deduce, similarly to our derivation of |T]), the following: 

(10) E[Tb lJk (7r)] > £M E[ntiJfe(7r )]. 

Also here, the sets S t (xi) (for 1 < i < n tjfe (7r)) are pairwise disjoint. Thus, for 
each long subpath (consisting of t arcs) starting at some Xi with |S 4 (a;j)| > g(t, k)/2 
we can count at least g(t, k)/2 — t vertices that are not visited by it. Therefore, we 
can conclude 

E[length(7r)] < n - - t)E[ m , k (*)] ■ 

Using (fTU)l and defining 

g(t,k) := (ii^L-t)^, 

this yields 

(11) E[length(7r)] < n - g(t, *)E[n tjfc (7r)] . 
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On the other hand, denote by 

(12) f(t,k) := \V\G(t,k)\ 

the total number of non-(i, /c)-good vertices. From the definition of our path parti- 
tioning, we immediately obtain 

(13) Epength(Tr)] < f(t, k)+t- E[n t , k (jr)] . 

Adding up nonnegative multiples of (|11[) and (flU)) in such a way that E[n ti fc(7r)] 
cancels out, one obtains the following bound: 

, / \ i tn + g(t,k)(f(t,k)) t 

E length rr < ! , < ^7TTrn + f(t,k) 

g(t,k)+t g(t,k) 

This yields the following estimation. 

Lemma 2. For t, k G IN urei/i i > 2 and k > 1, we have 

E[length(7r)] < n + /(*, fc) . 

g(t,k){g(t,k) - 2t) 

A general way to bound the function /(£, fc) is as follows. 

Lemma 3. For t, k € IN, we /law 

d* - 1 

/(t,*t) < -7-^^<fc , 

a— 1 

where h < k := X)i=o * s ^ e nurn ^ er of vertices with outdegree less than k. 
Proof. If w € y \ G(i, fc), then there is some w S Rt_i(?;) with | out(u>)| < k. On the 
other hand, each w is contained in at most X)i=o ^ = ITT — l)-reaches (since 
the undirected graph is d-regular). The claim follows. □ 

The following describes a way of bounding the function g(t, k) by studying the 
undirected graph of the polytope. 

Definition 2 ((i, /^-neighborhood, j(t,k)). Let t, k E N. 

(1) A subset N C V is called a (t, fc)-neighborhood of v S V if N — {v} in 
case of t = 0, or, if t > 1, there are k neighbors W\, of v in the 
graph of P together with (t — 1, k) -neighborhoods N±, . . . , of w±, . . . , 
Wk, respectively, such that N — |Jj =1 Ni. 

(2) We define j(t,k) as the minimum cardinality of {w € N : dist(u, u>) = t}, 
taken over all v 6 V and all (t, k) -neighborhoods of v. (Here, dist(w,u>) 
denotes the graph-theoretical distance between v and w in the undirected 
graph of P.) 

If v is (t, fc)-good, then it follows right from the definitions that the boundary 
dRt(v) of its i-reach contains a (t, /^-neighborhood N of v. In particular, all vertices 
w € N with dist(w, w) — t are in dR t (v), and these are the ones of use to us. 

Lemma 4. For t, k € IN with t > 2, we have 

g(t,k) > j(t,k) . 

3.2. Specialization to Cubes. In order to obtain from Lemma an explicit 
bound for the expected number of vertices visited by the Random-Edge simplex- 
algorithm on the d-cube, we will derive estimates on the functions f(t, k) and g(t, k) 
for k= Lfj. 

Lemma 5. There is a constant < a < 1 such that 

f(t, L|J) < 2 ad+o{d) 

holds for allt € N ( where f is the function defined in H2\) for the case of the d-cube, 
and with k = 
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Proof. In the case of a d-cube and fc = , we have 

L-J-i 

h <k = J2 (f) = ^ )d+o[d) , 

where h(x) = x\og — + (1 — x) log is the binary entropy function (see, e.g., jTUl 
Chap. 9, Ex. 42]). By Lemma [3] this implies the claimed bound (with the o(d) term 
depending out). □ 

The final building block of our bound for the special case of cubes is the following. 
Here, we denote by a- (falling factorial power) the product a(a — 1) • • • (a — b + 1 ) 
(for a, be N). 



Lemma 6. Let t, k G IN with 1 < t, k < d. If the polytope P considered in Section \3A 
is a d-cube, then the following is true: 



fc* sAk 1 ( d-1 



i=l v 

(2) If t is a constant, then j(t, = f2(d*). 

Proof. Part (2) follows immediately from part (1), since the sum becomes a poly- 
nomial in <i of degree t — 1 for fc = | (and constant i) . 

Let us prove (1) for each fixed k, by induction on t, where the case t = 1 holds 
due to 7(1, fc) = fc. Thus, let us consider the case t > 2. 

We may assume that the vertex v and its neighbors w\, . . . , Wfc, for which the 
minimum j(t, fc) is attained, are v = and tUj = <Bj (1 < i < fc). For each i, the 
(< — 1, fc) neighborhood Ni of has at least j(t ~ 1, fc) vertices w with dist(ei, w) — 
t — 1, by definition. All of them have distance t — 2 or t from (D. The former may 
be the case at most (^2) times (these vertices cannot have a one at position i). 
Therefore, we have 



\{w £ Ni : dist((D, w) — t}\ > j(t-l,k) 



d-l 
t-2 



On the other hand, every vertex w S Ni with dist((D, w) —t needs to have a one at 
position i (otherwise, dist(ei, w) — t + 1). Hence, every vertex w with dist((D, w) = t 
can be contained in at most t of the neighborhoods Ni, . . . , N k ■ Thus, we conclude 
(for t > 2) 

and thus, 

fc Ktl) 

(14) 7 (t,fc) > - 7 (t-i,A:)--^ . 

Using the induction hypothesis and ()14[) we derive 

fc t_1 

7 (t,fc) > " ' 




which, after an index shift in the sum, yields the claim. □ 



Now we can prove our main result: 



TWO NEW BOUNDS FOR THE RANDOM-EDGE SIMPLEX ALGORITHM 



9 



Theorem 2. For every fixed t G IN, there is a constant C' f G R (depending on t), 
such that the expected number of vertices visited by the Random-Edge simplex- 
algorithm on a d-dimensional cube, equipped with an abstract (in particular: a lin- 
ear) objective function, is bounded by 

2 d 

Proof. Let it be the (random) path (for some arbitrary start vertex) defined by the 
Random-Edge simplex-algorithm on a d-cube equipped with an acyclic unique 
sink orientation. By Lemma [2l we have, with d! := |_|_|, 

(15) E[length(7r)] < ^ 2 d + f(t,d>). 

g(t, d'){g{t,d') - 2t) 

From Lemma [5] we know that there is some constant < a < 1 with 

(16) f(t,d') < 2 ad+ °W . 

Finally, by Lemmas 0] and [5] (2) there is some constant (3 > such that 

(17) g(t,d') > (id 1 . 
Putting (fl~5|) . (fl~6|) . and ((TT|) together, we obtain 

Afflt 

* ^-2 m 2d + rd+ ° {d) > 
which implies the claim. □ 



4. Conclusion 

Probably one can extend the methods we have used for analyzing Random-Edge 
on cubes to other classes of polytopes (e.g., general products of simplices). However, 
it seems to us that it would be more interesting to find a way of sharpening our 
bounds by enhancing our approach with some new ideas. As mentioned at the end 
of Section^ the analysis of our approach is sharp in the general setting. We suspect 
that one cannot prove a subexponential bound for Random-Edge on cubes with 
our methods. Therefore, it would be most interesting to find a way of combining 
our kind of analysis with some other ideas. 
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